RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHN ULOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems firanch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Source: 

Date Processed by STIC: 



/6sZ 



/2-29-oa 



THF ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

ro R N SS^^QlI™ONS. PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ™™^A* 703-308-4216. 
PATENTIN 2.1 e-mai. heip: P atin21 helpQuspto.gov or phone 703-306-4 9 R. Wax 
PATENTIN 3.0 e-mail help: pstin3nheln@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
w.STo15o^M, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1 .825 eff ective October 1 1990 (old 
mles) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in ^^^J*™ 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result m fewer errored sequence 
listings, thus saving time and money. 

rh.rkPr Version S 0 ran he down loaded from the TISPT O website at the following address : 

http://www.uspto.gov/web/offices/pac/checker 
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1652 



RAW SEQUENCE LISTING DATE: 12/29/2000 ^rtrfiOW A 

PATENT APPLICATION: US/09/478, 188 TIME: Jl:55:07 K\q\ ^ 0 T\ oftd© G 

input-. Set : A:\M-B960-l.app v *oC\0\ S 

Output Sen: N:\CRF3\12292000\I478188.raw . ^q^^ C ^ 

3 <110> APPLICANT: Shen, Ben O. 

4 Liu, won C6L^ 1 

5 Christenson , Steven D . ^ 

6 standaqe, Scott 

B <:.120> TITLE OF INVENTION: GENE CLUSTER FOR PRODUCTION OF THE ENEDTYNE ANTITUMOR 
9 ANTIBIOTIC C- 1 027 

11 <130> FILE REFERENCE: 2 5 00.128US1 

13 <14 0> CURRENT APPLICATION NUMBER : 09/478188 

14 <1 4.1 > CURRENT FILING DATE ; 200 0-01-0 5 



1 6 


<150> 


PRIOR APPLICATION 


NUMBER 


: 6 0/1154 3 4 




17 


<151> 


P R I OR F I L 1 N G D AT E : 


19 99- 


01-06 






1 9 


<160> 


NUMBER OF SEO ID NOS : 10 








21 


<17 0> 


S O V TW A RE : P a L e 1 1 1 1 n 


Ver . 


2 . 1 






23 


<210> 


S EO 1 1-1 NO : 1 










2 4 


<211>- 


LENGTH: 4 2 000 










2 5 


<2.12> 


TYPE: DNA 










2 6 


<213> 


OR CAN I SH : A r t: i f: i c i a 1 S e queue e 






28 


<220> 


FEATURE : 










29 


<2 2 3> 


OT H E R I N F O RMAT 1 0 N : 


Descr.i pt i.on of 


Artificial Sequence 


31 


<2 20> 


FEATURE : 










3 2 


<2 2 3> 


OTHER I NFORMATION : 


orf ; 


relative 


posi 1 1 on 


6 58-11 


34 


<2 20> 


FEATURE : 








1478-930 


35 


<2 2 3> 


OTHER INFORMATION : 


orf; 


re la five 


position 


37 


<22 0> 


FEATURE : 










30 


<223> 


OTHER I N FOR MAT ION : 


orf; 


relat. i.ve 


posi t ion 


2 713-164 9 


40 


<220> 


FEATURE: 










41 


<2 2 3> 


OTHER INFORMATION : 


orf; 


re la Live 


position 


3238-2851 


4 3 


<2 20> 


FEATURE : 










4 4 


<223> 


OTHER INFORMATION : 


or f ; 


rel at i.ve 


posi t ion 


4971-3442 


4G 


<.2 20> 


FEATURE : 










4 7 


<2 2 3> 


OTH E R I NFORMAT ION : 


or f ; 


relative 


posi tion 


5982-74 78 


4 9 


<220> 


FEATURE : 










50 


<2 23> 


OTHER INFORMATION : 


orl ; 


relat i. ve 


posit ion 


9900-7573 


52 


<2 2 0 > 


FEATURE : 










5 3 


<22 3> 


OTHER I NFORMAT I OH : 


orf ; 


relative 


pos i. tion 


1134 9-998 2 


5 5 


<220> 


FEATURE : 










5 6 


<2 23> 


O TH E R I N FOR HA 'J 1 1 0 N : 


orf ; 


re 1 at i ve 


posit ion 


28590-29588 


5 8 


<2 20> 


FEATURE : 










59 


<2 2 3> 


OTHER INFORMATION: 


orf ; 


relat.i ve 


position 


296 32-31197 


61 


<220> 


FEATURE : 










62 


<2 23> 


OTHER I N FORMAT I OH : 


orf ; 


relative 


posi tion 


31280-32590 


6 4 


<2 20> 


FEATURE: 








32809-34 39 2 


6 5 


<223> 


OTHER INFORMATION : 


ort"; 


rel at i ve 


position 


6 7 


<220> 


FEATURE : 










68 


<223> 


OTHER INFORMATION : 


ort ; 


rel at i ve 


posit i.on 


3 5274-344 58 


7 0 


<2 20> 


FEATURE : 










7.1 


<223> 


OTH E R I N FORMAT ION : 


orf ; 


relat i. ve 


position 


1 7924.-16653 



f ile://C:\CRF3\Outhold\V srI478 1 88.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 78,188 



I) AT lil : .12/29/20 00 
TIME: 11:55:07 



73 
74 
76 
7 7 

79 
80 
82 
S3 
85 
86 
88 
89 
91 
9 2 
9 4 



9 8 

I 0 0 
101 
103 
104 
'1 0 6 
107 

10 9 
110 
i 12 
113 
115 
i 16 
! 18 
119 
121 
122 

1.2 4 
1 25 
127 
128 
"I 2 9 
:i 3 0 
131 

1.3 2 
133 
134 
135 
136 
137 
138 
139 



Input Set : 
Output Set: 

<2 2 0> FEATURE : 

<223> OTHER INFORMATION : 

<220> FEATURE: 

<2 2 3> OTHER I INFORMATION : 

<2 20> FEATURE- : 

< 2 2 3 > OTHER 1 NFO RM AT I O N : 

<2 20> FEATURE : 

<22 3> OTffER IN FORMAT TON : 

<2 20> FEATURE : 

<2 2 3> OTHER INFORMATION: 

<2 20> FEATURE: 

<2 2 3> OTH ER T N FORM ATT ON : 

<2 20> FEATURE: 

k22 3> OTHER INFORMATION; 

■,220> FEATURE: 

< 2 2 3 > O T H E R T N FORM AT I O N : 

<2 20> FEATURE; 

<2 2 3> OTHER I NFORMATI ON ; 

<220> FEATURE; 

--:22 3> OTHER INFORMATION: 

<22 0> FEATURE; 

<22 3> OTHER INFORMATION: 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: 

<22 0> FEATURE: 

<2 2 3> OTHER INFORMATION: 

<22 0> FEATURE : 

<2 2 3> OTHER INFORMATION: 

<2 2 0> FEATURE; 

<223> OTHER INFORMATION: 

<2 2 0> FEATURE: 

<2 2 3> OTHER INFORMATION: 

<2 20> FEATURE; 

< 2 2 3 > OTH ER I N FORMAT ION : 

<2 2 0'> FEATURE: 

<22 3> OTHER INFORMATION: 

•v4 00> SEQUENCE: 1 

q t eg tic t c L a g a g g a r. c c c q 

c g a c q a c t g r : ggcg a a g g g c 



A:\M-8960-l.app 
N:\CRF3\1229200 0\I4 7818 8 .raw 



or f ; ro la t ive pos i t ion "16653-15919 
orf; relative- position 159 22- 14 690 



o r f ; re I a 1 1 v e p o sit i on 1 4 6 4 3- 14 2 1 2 



orf; relati 



position 1.30 12- 14 0 79 



orf; relative position 12835-11351 
orf; relative position 25564-24986 
orf; relative position 24702-23566 
orf; relative position 22878-21424 
orf; relative position 21.407-19926 
orf; relative position 19929-1926 7 
orl; relative position 19191- 1 80 31 
orf; relative position 35938-35516 



re.l at ive 



27214 -28593 



posit ion 

orf; relative position 25815-27170 

or f ; re 1 a t i ve pos i t ion 23 546-22875 

orf ; re I a t ive pos it ion 3 5 2 7 A - 3 4 4 5 8 

orf; relative position 37 559-389 38 

orf; relative position 4 0 986-3936 7 



ggtqcggagt 
ggttcottga 



o c g c g t c g a g g a f c t g c g t g 

g g g t g a g go c o c t g a c g g t. c 

c c c c g t c ego 1 1 cc a c a a g g 

ggge.g tegag gtagtcctgg 



agq/gg t tacg 
gttcgaggec 
teggggageg goccagggog 
acctcgaagc agogg fog tg 
a o g a c g c c g g g a o a g g a o t c 
a a g a t g c g g o g g g g g g o g g g 



g a e g a a g g a g g g g I . g c o c g g 
g q t g g o g a g g a e g a c g t g g t 
oagoecotcg gtcaggtacg 
g g a o c g g g c g t c g a g c g c c t 
c c g t g c g g c c t c g a c o a g t c 
g c c o t g t : t c g g t g a a e 1 1 c c 



aegaagecca gegceggggc oagtcgcgcc ggtoggcofc etggttggcc eagftgatga 

agtegageae gtcctogegg aaeaeegaca tcctgccggc ctggatatfg aagacgtggt 

ceeaggggtt gc eg tea egg tgataggega egccggccga gcggtaggcg gcgcgocgct 

ceaggaggae gaettcoagc ggtcttcteg cgaaatgaag eaggegtafe gcggtcgocg 

tgocfgccag gcocgococt aogaccagca ocotggggog ogcacccgtc atgcooafga 

agoctcocoe gctgaotcag ggcggogcgf egogogctee cgtoggtgte cfegetgact 



60 
1 20 
18 0 
24 0 

3 00 
36 0 
420 

4 80 
54 0 
600 
6 60 
720 



file:/7C:\CRF3\Outhold\VsrI478 188.htm 
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RAW SEQUENCE LISTING 

PATBNT APPLICATION: US/09/4 7 8 , 18 8 

input Set : A:\M-8960-l.app 

Output Set: N:\CRP3\12292000\I4 7 8188.raw 



DA Tii 1 . : .12/29/2000 
TIME: '11:55:07 



14 0 
141 
14 2 
1 4 3 
144 
14 5 
14 6 
14 7 
148 

14 9 
150 
151 
1 5 2 
1 5 3 
154 
155 
156 

15 7 
1 5 8 
159 

1 6 0 
! hi 
162 
16 3 
16 4 
1 6 5 
1. 6 6 
.1 6 7 

16 8 
i 6 ') 
1.70 
1 71 

173 
174 
1 7 5 
1 7 6 
177 

17 8 

1 7 9 

18 0 
10.1 
1 82" 
IB 3 
18 4 
ID. 5 
186 
13 7 
18 8 



gyaagttccc 
cygg tcaggt 
agcegcyecc 
agcetgtccg 
teg a tot egg 
acgaagacc t 
gcg tea y acq 
age tteaegt 
cccgegacct 
cccaegycya 
gctccgytqa 
a tg t tgagca 
ggtgcggtgt 
atgggagttc 
gccctceygc 
gcaeca ccgq 
tgccccgaac 
eg gcg a a egg 
ggyeggegag 
ggtegg tcac 
ggacccgct.c 
ggaagagctt 
cgccgccggg 
ggacttcgtt 
gggacagceq 
c teggatg tc 
cctcgatggg 
teagggggeg 
ageggaaggg 
tga. cage gec 
go ttggactg 
tgtccgcgac 
ca tgcgcygc 
tgqcgatygt 
tgattgeege 
gtatgggggg 
ggtgcccagc 
gtcgaeggoe 
atgggggagc 
gtcggccccg 
ettgegcegg 
gagccacacg 
ggegacctet 
tegtctgegg 
geaegggcag 
gtgcctccgg 
ccggccgttg 
gytygcgaay 
g taygacy tg 



tgaectgycy 
eg tgoacyat 
gygyctcyac 
ggggcgtgcg 
tgcgggcgt.a 
eygegayctc 
cctegeccac 
cytogee g qg 
tegecgacac 
ggqega tcg<j 
teg egg eg ta 
ct cgcccgaa 
eg a t- ggtgtg 
cteg tecctc 
ccc ttctaqg 
gagecccgag 
acq gqecteg 
gyagaagecq 
cagyatgegg 
ccc qagqaay 
gggqtccgct 
gygcagcaq t 
gcaggtgtgt 
y ttyagqgcy 
cccctcqgtg 
ggc tteggee 
agtggcggqq 
gtecgtgayo 
cccttgggtg 
gtegggegag 
ttcaecgt.ee 
ggcctgt tec 
aagggcgtgc 
cat gg cog a a 
ttt tteaggg 
eyygaygagc 
tqggaqcggg 
tctccyggyy 
aggaagaaga 
ay t. tcga tyt 
ggggtcttgt. 
acctt.eycct 
ccttcgtcgg 
cggtgggocg 
gtacg tcctg 
tcygayyacg 
gat ttgatca 
geaegggegg 
tagagaaggc 



tcaactceac 
eg tgg caeca 
qqqqcgggyc 
t.gcqgggcgt 
gtggttgaag 
ggtgtcegtc 
ttcyccygeq 
cgteccccy y 
cgtgtgcgcc 
e tegegty tg 
ggt t tceagg 
ccge ttetec 
gaegqqaate 
cagtctgcec 
cay g teg ccc 
qggcgaggte 
a Lot tggcqa 
c-aq tegtege 
tcgegtacet 
aegegqqegy 
tegccggcca 
teggegtagt 
acgccgatgc 
d tga a g teg t 
a aqtegaqct 
tegtcgyega 
tagaggaggc 
tgccgtgcgg 
atgctgggga 
agggtgtcga 
acgay gaegg 
tgetgtttgg 
agyagtgtcg 
gay tagggaa 
gaag t t.gat.g 
Ctgcqggqtt 
ggytettttc 
eacet tgccg 
cceggegecg 
a g cog a teat 
ccagggcetg 
cgtgaacyag 
eg tgcaoegg 
cagtygtgcg 
gggcaetca c 
ttcattcgtc 
tgteggcagg 
tcccggggcg 
cctg ttcgac 



tga tecg taa 
gacaga tcac 
accggca.ggg 
cagctqtcqa 
tagt I )(.) tqt 
catccctgtg 
a tctcectgg 
cgaa tegeca 
gectgycagt 
eg qg eg teg a 
accaeggggg 
agtcygegea 
egcggcatgg 
aagcaectcc 
ygtgytycyg 
agaggcegag 
aggecaggtc 
aggtteeeag 
gctegggggt 
caggygycay 
g ttcgagata 
egaty tcyag 
gggeggtttc 
cyaggacgee 
ggacca cgtg 
gg tegegcag 
tgagggcgga 
cgegcaya ta 
gcT-gecgggt 
ggceggteae 
ggctgecgac 
ccaggtccgt 
eggagegegg 
gaggctgggt 
eg aag tegee 
ctagga gceg 
gecgacgegg 
gtagacgect 
gtaoayaeog 
gcggccgtcg 
geggaegtag 
ategctgteg 
gtgggqaagc 
gaecgee cgt 
ategtagatg 
ggc tyceaga 
tga ggegagg 
ga tgect tea 
gcg tag cteg 



ggggategcg 
caegtega ta 
gcggccgcg t 
tgtegggaae 
agaggttcae 
ecacggccgc 
ecacctggao 
cggtctcctc 
acgcgeacgc 
acgtteea tg 
aatgggccat 
yyatytctec 
gaa tgectet 
cccggtgagc 
ccccaggacg 
eacc tec teg 
gegtgtggtg 
ttgctcgaeg 
ctcgaeeact 
g tgg tcaegy 
gaagttgccc 
gctgtgcg tg 
ctcggcgctg 
gecgctgggg 
tgccceegcg 
gaa ct get eg 
gggtycgatg 
ggtt teg gec 
gtgeeegtet 
ggygtiaggtg 
teyttccagt 
ggcg tccagg 
aaggetgecg 
ttegaaceac 
gageggcqga 
gtegeggeca 
ttgggctega 
tcyggy tegg 
ctgtcegggt 
cgqgcgtagc 
tcga g tec ct 
gtcagtagcg 
ggtgeetgcg 
gg tgcegg tt 
gggtcegett 
gcg a ggt tgg 
cccacttcct 
ctgtgtgcgc 
ctg ttctegg 



ggaq tggata 
ggcac teg tg 
q a tea g ccgq 
gecagggacy 
g gec a eg tgq 
gtteeaegag 
ca.gtgct.tcg 
eagegtqaaa 
ytcgaecgeg 
tteggegacy 
teccceg tgg 
gecygctyeg 
ectegtagtg 
tgtcceggec 
tca.ee tegee 
gecagggegg 

qagqtqtcqt 
ggga tgtaqc 
gggtega teg 
a eg a tge tea 
gect tga. go t 
gagtectygt 
aagcgeccca 
tcga yet tga 
tccaggcagc 

aecqcc tget 
cgeaectggt 
gegaagggga 
gcg a a get eg 
ogtgtcaqgg 
gttecctggg 
ategqeteag 
eg ea a a get t 
a cgtg c tga L 
cggtggagga 
tggtgcggyg 
agteeegg r.c 
ecgetteggc 
gcg ge t tg tt 
egggatet tc 
age tea tgge 
tgatgtgtyt 
etegg ccaaa 
ccgcagggca 
ggtagaaett 
gg egg a ce eg 
aceaggtget. 
ggtegtgyag 



780 

84 0 
900 

9 6 0 

10 2 0 
1080 
.114 0 
1200 
1260 
1320 

13 80 

1 4 4 0 
.1500 

15 60 
1620 
.16 80 
1740 
1800 
18 60 
1920 
1980 
204 0 
2100 
2160 
2220 
2280 
2 3 4 0 
24 00 
2 4 6 0 
2520 
2580 
2640 
27 00 
2760 
2820 
2 8 80 

2 94 0 

3 000 
3 f) 6 0 
3120 
31 80 
324 0 
3 30 0 
3 360 
34 20 
3 4 80 
3 54 0 
3 600 
3660 
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RAW SEQUENCE LISTING 

PATRNT APPLICATION : US/09/4 7 8, 188 



DATE: .12/2 9/2000 
TIME: 11:55:07 



.189 
190 
191 
192 
19 3 
194 
195 
196 
197 

1 9 a 
199 
2 00 
2 01 

20 2 
2 0 3 
2 04 
205 
2 0 6 
20 7 
208 
209 
210 
211 
212 
2 13 
214 
2 1 5 
210 
217 
218 
219 
2 20 
2 2 1 
222 
223 
2 24 
22 5 
226 
227 
2 28 
229 
2 30 
231 
232 
233 
2 34 
2 35 
2 36 
2 37 



gcaqcv.-teq 
cja t.gcggacc 
t toqqtyaqc 
gtcqat Cttq 
gaaggegaaq 
cytgyytgty 

uqteaqcaqq 
qatgaqqeqq 
gtccatgagq 
q tcgagyaga 
cccqacgtca 
ecct teuyeg 
qtCCtqgqqq 

ca taeygecg 
g tcgaggytg 
ottca t aeeg 
eaqecoyegy 
a q cttLer eg 
ctocaoglge 
q qtcaq qctq 

ayeggoctgg 
otgagogyeg 
teyagatgga 
eqgacagag L 
ccqqccgaeg 
acq ay eg gee 
tegteeqtea 
gt caa yea to 
taqggqgttg 
qq tcccttgg 
teggtqaggq 

q gqgegqaqa 

gttttyaygg 
agaaageagt 
geggaag t tg 
ccgtcgccgc 
gtaatcgt.gc 
oggayagygt 
ca tttacccc 
gtygcqgatt 
eg accco tat 
agaaoggecg 
cgaac tacec 
eggeggagtt 
ggacca tccc 
get acta eye 
tcttcacc tc 
tg g 'tgg a gga 
cgetgecget 



input Set 
Output Set 

gegaggaage 
eggtegggge 
caytycagqa 
cgttcgtcyy 
ccyccyg tgg 
aaggagaaa t 
eggacgegey 
cyyecgtgqa 
aaacccgq \g 
aeggatttye 
ecggtgatgy 
t cactgccga 
gaggegctgg 
ttgcggaggt 
tegggatega 
q tegtcgaca 
qyatcgctgc 
cc tceygccc 
cggaactggt 
qcqggaatc t 
acaggy agya 
geagtt.gcgt 
geggteggge 
gecaggygea 
gggggaagca 
eggtaegggg 
gettgegtet 
tttcgtgaca 
tcgggtgggg 
gtoccteocc 
gccccggt.gg 
gaytttcyg t 
gtyaaaaagg 
atgacgata t 
agt.ccctt.ca 
tcegeycagg 
agtgccccct 
octagaaecc 
atgggggcgc 
gegoatgyea 
ccagcgtccg 
gtcoeeggag 
gttccccacc 
oyagegogco 
cgagaaggtg 
cacogacagc 
ggtggaggcc 
cctcgacocc 
cacggtcacc 



A:\M-8960-l . app 

N : \CRF3\1229-2000\I4 7 8188 . raw 



ggccgatgtg 
cggcqagtgt 
tcccggggcc 
ggacga tecy 
agacctcgyg 
agtcctgceg 
ectegtegaa 
ytteggtgay 
gggctgcgtc 
egttctttcc 
agtagecgay 
aggtgtcttc 
tggcgcggqa 
cgaccactcc 
gygayaggto 
qggtgcggcg 
cgqgca tctc 
gc t tec a cog 
caogytagag 
cgcccgcc t;c 
gcggogctgg 
caaagogagg 
gytccccgct 
ycgeatgtge 
qgggccqqca 
ggaagyge to 
ggc ttcagee 
cteggegayy 
accgegecte 
ggatcgegge 
ayggae tgag 
ccctgcgttg 
gactgaaygg 
cgycgcetac 
gtccct:ttt.c 
gaccgaagag 
cccccgtttc 
ctcaggggcc 
ttgggggcgt 
.ggaga tgeco 
ctgatcctca 
ca ycegcagg 
tat. g a eg ggc 
cacecccgat 
gcgcgcgcca 
cay tt ggc gc 
gcgttggccg 
g tggtgcgcg 
aeca tgetet 



g tcctcgyt q 
gtcgcyggLq 
ctcqtcctqg 
ttcgaaggqc 
gcgglggttg 
eatgeggeqg 

qcqqtcqttq 

ctcgytygag 
ggcgta gtcq 
ctggccgtgy 
aaggaqg tgg 
gagyaaacqg 
gtggaagtcc 
gtcaggggtg 
g ggaga ygee 
tttqtqqtq q 
c tccgccatc 
gtagccgt.ee 
aeggacqaag 
ccaggcgqtc 
ggccgyggtg 
gccctcggcg 
gegggaaegg 
gggggga caa 
acegggt ggc 
gtctctocyt 
tcetqacecc 
gactgaagyg 
gac tccc egg 
agggacccaa 
qg tctgta tg 
a g tccc tgg t 
actcaacttc 
a tacgegege 
gtggggtcgt 
ggaccaagte 
ccacagcgag 
ytt.ctcftggc 
caggagggct 
cgacagcggc 
ggaggcagac 
aagageggat 
gtgact tcct 
aeegggtega 
ccgcggaggg 
ggyacyogoy 
geeggaegga 
ac tcctactc 
gctacgeeaa 



ttcgegtaty 
gegaggtage 
acgayttcg a 
aggaggogq a 
cccagca gec 
gecttga tct 
ggc t tqagct 
tyttcqgag t 
ecyagaa tct 
agaaagggca 
aggaagtcga 
t.gccagcggg 
cgggtggggt; 
caeagygegt 
tttycctggg 
t qcagttcce 
te teeggcag 
ca ggaqtaoc 
aqcttggcyt 
qegqeqaogg 
g tttegaggg 
e tgetget ca 
ca tgaatgat 
eggeccg It: t 
ggqgcggcgt 
ggggeggcac 
caa taaggcg 
actgtcttte 
oggaegggat 
gggggcggtg 
gagcgat:aag 
ca tcaccgca 
cccattatga 
gtaoatag tg 
atcccc tc tg 
cc tgcycgyg 
teg tcgct.ee 
ectctggg cc 
tgtgaggget 
egggaatega 
ettgeagget 
cgtcctggac 
cgctccgctg 
ca I caa eg go 
caggee tccg 
cay goecyac 
qataetgy ga 
gttcgggyqc 
c tec tccc tc 



cgctggcygc 
ggcgggcccc 
cayccagqtt 
tgcggcgcca 
acagcttgtg 
tgtcaccgcc 
cgctgcacac 
a tgcgccacg 
gga tcatcac 
gcacetgcgc 
tcatctcccg 
qyqtgggga t 
egqgettgeg 
aggggtctec 
tgag gagege 
qgtcgqtgaa 
eecacaqgge 
agcccaggcc 
tgccgcggtc 
gggecteggy 
eeagcatctg 
tgyaeytcct 
cttcceggtg 
cygacgaggg 
qagegaggge 
gttgtygtcc 
aaagetgety 
gga at gag tg 
c tg ttcggtc 
eggegggegg 
agggtctgaa 
ggtcagaggg 
gctgagtaya 
agctta taa t 
actgcgttga 
gcgggegacg 
cctg tyagyc 
r.cetectgyc 
ctgccgggaa 
cgatgtccec 
ecagaagega 
gta tggctgg 
egcgaqcggg 
cacgacttct 
cacatagegg 
gggaagcegg 
cacccggtgg 
gagttyg tgt 
ctcgcgcgcg 



3720 
3 780 
3 84 0 
3900 

3 960 
4020 
4080 
414 0 

4 200 
4 260 
4320 
4 3 80 
44 40 
4 500 
4 55 0 
4 6 20 
4680 
4 74 0 
4 8 0 0 
4 86 0 
4 92 0 

4 9 80 
504 0 
5.100 
516 0 
5220 
5280 

5 340 
54 00 

54 60 
5520 

55 80 
5640 
5700 
5760 
5820 
5880 

59 4 0 

6 0 00 

60 6 0 
6 120 
6180 
624 0 
6 3 00 
6 360 
6420 
6480 
6 54 0 
6600 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/4 7 8,188 

T il pu L Set : A:\M~8960-l.app 

Output Set: N:\CRF3\12 29 2000\I47818 8.raw 



OATK: .12/29/2000 
TIME: 11:55:08 



23B 
239 
240 
24 I 
242 
24 3 
2-4-1 
245 
24 6 

24 7 
248 
249 
250 
251 
2 52 

25 3 
254 
2 55 
2 5 6 

25 7 
258 
259 
260 
261 
262 

26 3 
264 
26 5 
266 
26 7 
268 
26 9 
270 
271 
272 
273 
274 
275 
276 
277 
278 
279 
28 0 
281 
282 
28 3 
284 
285 
286 



ccgg ty ttec 
ccagcgtcga 
agcagqceg t 
ccqecaegac 
acc tcea tya 
teg a g get t L 
da c tea tec a 
a eg cyaagqc 
gactcgacga 
ecaacgeegc 
aactgc tgga 
agttgaaqqc 
ceg tcaaeqa 
cyaaggcett 
gqaaecgetc 
gcgccgccgc 
caggaegtay 
ggagyaa f.cc 
ceg teceecg 
q tqctcgatq 
ggtgteaaeg 
eg tgcggtge 
aagcggctgg 
caycagggcc 
cgcgatcgac 
acagacacog 
yeggcect gg 
gccggtetcc 
tgtgyagcyc 
ctcca tgaat 
gg tea gca ca 
ctcca egg tc 
cagaycctcc 
gatctcgcce 
ga teaeatcg 
caycaygteg 
gatgtacgtg 
ctcgcccccc 
cacga tccge 
gacgecgyeg 
ct tgeegteg 
acagactccg 
qaggtcgegc 
gaactcggtg 
cagctcctcg 
ccagatctgc 
cagggact tc 
g tegcagtec 
ggacggcccg 



ggayttgece 

cggggggccc 
cgcecttcay 
qgtqgaey tc 
gey eggecat 
eg Cecayeag 
qgeegg tgea 
cccyytageg 
gaeeaegcag 
ggac tggcac 
agegacagg e 
g teegaeegg 
qgtca tea cc 
cgecgaagcc 
eggatecgyg 
ecgctttccc 
qy teaaeeey 
gegqegagea 
tegggaeeca 
accacqaegy 
toeyacatg L 
agetggtegg 
cceagqctga 
ttc tcgqtga 
t tie e eg eg a a 
cagtgcgtgg 
cactcet: ego 
ttc gcgaaca 
gacgaggeyg 
gccccggaga 
ccqgtgggea 
agctcccccg 
ccygtccy gg 
ccqtgcactc 
ggg teg tget 
ttgagccgcc 
agcccggeca 
gagaggtegg 
teeagygecg 
ageace tceg 
ateeggaegt 
teggtgaega 
Ctgaggttga 
etettggtet 
qcgctgtact 
caq teeggqe 
gageggtcea 
gggcaca tgc 
tcgtccttcy 



egtucctggg 
gg Lea egg a a 
aaeggggtye 
aca (" eggaey 
taectctaca 
a agg tegca t 
eaqgccggt t 
g gee age cog 
gaegggctgc 
egcaccaaeg 
tggt Leg acc 
tcaeeqqeqq 
geageqatqg 
ggeytggccg 
acc ec etc eg 
qgcqgggca c 
eetycycett 
gg tcettegg 
ggtcgatgat 
tyt tecegqe 
gc a gee egg t 
eaagtttgat 
ggtacccaag 
agaactegac 
qctgqtgctc 
tear; egg a tc 
acgacecett 
gcttgegcag 
cgategegga 
teaggctget 
egqeeaegga 
tgqqcqqqcq 
tcttcgcctt 
ccgccccggg 
cgaogaccag 
ecaegtzegeg 
gaecaetgec 
ccy tgygect 
tqegegegge 
tg agg tegeg 
egagegegge 
aacqttcgat 
gecget cgaa 
tcagegteae 
eggegategg 
tacceacett 
gca tct tgte 
cctgqgggtc 
teg '.gecgaa 



algaggtcga 
tcacc tgggc 
tgaecqatea 
aga tgctgga 
e ggg egg gee 
t cacctt cga 
tcgagg tege 
tctegggaqa 
tcyctctcac 
g tttcg tacc 
qceggeegca 
c get eg gey c 
a eg at g tec t 
eecag caaet 
eey tetg ay a 
tggecgqggq 
eaggtygegg 
tgtgccgqtg 
ceagteqgee 
cteqacqagc 
ggtgggcteq 
ecqetgcag t 
aecgaegteg 
ggcetcg teg 
caggaectcg 
catgaaggee 
qgagtt gaag 
egggtcca tc 
ctygtcgaea 
ettgeeggaa 
qacetgette 
gacetect.ee 
cegcaycttc 
accgacateg 
caegg tgttc 
eggytycayg 
gaggtygegc 
ytccaggg tc 
tttcg eg a ga 
gaccteea tg 
ggcgttgagc 
gacctcgcgc 
c egg teg gee 
ettcccgecg 
cttggccgga 
gtactcgggg 
eaggtcgagg 
g ttgaacgag 
ceg tgcgaae 



ageagectge 
co a eg a eg gc 
qgaeaacqge 
etgqgtccge 
etegqactyg 
ctegtccaag 
qqtq ttceeg 
cteectgtgg 
ccagt acc tg 
gg tgaceggc 
gcaaegggtg 
qetqeteqqe 
gegcag tgga 
getcqa tgec 
tccyytaccy 
a eat get etc 
cycagatac t 
aaqacqatct 
tqctgeaeea 
cegtccagga 
tccaygacat 
tcacegcegg 
acgagagegc 
gegggcaget 
ggettgaagc 
agcteggtga 
ctgaacagcg 
aggeegaggt 
aagaccgegt 
eccgecaccc 
agg ttgtgga 
tteaeqegqq 
gegaaggacc 
acgatgtggt; 
ccc ttgtegc 
ccgatgctgg 
acc a tct tea 
agg tagceya 
gqqgcagegg 
ctcgagtagt 
egcge-gcccc 
ttgegytege 
aacecctegt 
gtgeegegea 
tccagacggc 
aaaaggaccg 
gegatgetet 
aacgeggaga 
agggecegga 



caggegytgg 
Lgygttttcc 
cgctceqget 
t qq tggaege 
ygcggygcgt 
gecgeeeggy 
t tgeccaqga 
ctqqceqcgg 
ateagecegq 
g egg ccgg gg 
yecgygqage 
qactteqeqq 
yegg a ccc eg 
tacaaegecc 
ggyeacdygg 
ccqcccecqq 
eaceggtcaq 
eg ceg ccc tc 
ca tcgagg tt 
q cttcagcaq 
aqaceqtqce 
agaggetgqa 
geagtttegg 
ceaggaegtc 
qgcgcccete 
tga t.gaecec 
agg eye Lege 
aggagaccgq 
cggggtgcq e 
egg tca.ee gc 
gatccqcg! t 
ceccceqceq 
cctegaacac 
eggega tctc 
geagegegeg 
gctcg tcgaa 
gccgetgccc 
ycecga tgya 
ccgqetcegt 
egg eg a tg t t 
ggcaggagqg 
teagegeget 
agt teg tct g 
gca gey tg tc 
egg act tege 
ccecgtcgtc 
gg ceg ay acc 
cqeegagega 
teatcgye tg 



6660 
6 7 20 
6 7 80 
6 84 0 
6 9 0 0 

6 9 6 0 
7020 
7080 
714 0 

7 2 00 
7260 
7 3 2 0 
7 3 80 
7 4 41) 
7500 
7 5 6 0 
7 6 20 
7 6 8 0 
7 7 40 
7800 
7 860 
7 9 2 0 

7 9 8 0 

8 0 4 0 
8100 
8160 
8 2 20 
82 8 0 
8 3 4 0 
8 4 00 
8 4 6 0 
8520 
8580 
8 6 A 0 
8 7 00 
8 7 60 
8 8 20 
88 80 

8 9 4 0 
90 00 

9 0 60 
9120 
9180 
9 24 0 
9 3 00 
9 3 6 0 
94 20 
94 8 0 
9 5 40 
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<210> 101 
<212> DNA~~ 
<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<220> 

<223> partial ORF 
<400> 101 



<210> 102 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<2/<D> /o/ 
<yoo> /o/ 

ooo 
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VERIFICATION SUMMARY DATE: .12/29/2000 

PATENT APPLICATION: US/09/4 78,18 8 TIME: 11:55:09 

Input Set : A:\M-8960-l.app 

Output Set: N:\CRF3\12292000\I4 78188.raw 

L:2435 M:282 W: Numeric Field Identifier Missing, <211> is required. 
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